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Abstract 

We study a particular type of subcritical Galton- Watson trees, which are called non- 
generic trees in the physics community. In contrast with the critical or supercritical case, it 
is known that condensation appears in certain large conditioned non-generic trees, meaning 
that with high probability there exists a unique vertex with macroscopic degree comparable 
to the total size of the tree. We investigate this phenomenon by studying scaling limits 
of such trees. Using recent results concerning subexponential distributions, we study the 
convergence of three functions coding these trees (the Lukasiewicz path, the contour function 
and the height function) and show that the situation is completely different from the critical 
case. In particular, the height of such trees grows logarithmically in their size. We also 
study fluctuations around the condensation vertex. 

Keywords. Condensation, Subcritical Galton- Watson trees, Scaling limits, Subexpo- 
nential DISTRIBUTIONS. 
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Figure 1: The first figure shows a large non-generic Galton-Watson tree. The second 
figure shows a large critical Galton-Watson tree with finite variance. 
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Introduction 



The behavior of large Galton- Watson trees whose offspring distribution fi = (/ij)«>o is critical 
(meaning that the mean of /x is 1) and has finite variance has drawn a lot of attention. If 
t n is a Galton- Watson tree with offspring distribution /i (in short a GW M tree) conditioned on 
having total size n, Kesten [23] proved that t n converges locally in distribution as n — > oo to 
the so-called critical Galton- Watson tree conditioned to survive. Aldous [1] studied the scaled 
asymptotic behavior of t n by showing that the appropriately rescaled contour function of in 
converges to the Brownian excursion. 

These results have been extended in different directions. The "finite second moment" condition 
on /i has been relaxed by Duquesne [11], who showed that when /i belongs to the domain of 
attraction of a stable law of index 9 G (1,2], the appropriately rescaled contour function of tn 
converges toward the normalized excursion of the ^-stable height process, which codes the so- 
called ^-stable tree (see also [25]). In a different direction, several authors have considered trees 
conditioned by other quantities than the total size, for example by the height [24, 29] or the 
number of leaves [32, 26]. 

Non critical Galton- Watson trees. Kennedy [22] noticed that, under certain conditions, 
the study of non-critical offspring distributions reduces to the study of critical ones. More 
precisely, if A > is a fixed parameter such that Z\ = Y,i>o < 00 > se ^ /4 — Hi^ l /Z\ for 
i > 0. Then a GW M tree conditioned on having total size n has the same distribution as a GW^*) 
tree conditioned on having total size n. Thus, if one can find A > such that both Z\ < oo 
and /i( A ) is critical, then studying a conditioned non-critical Galton- Watson tree boils down to 
studying a critical one. This explains why the critical case has been extensively studied in the 
literature. 

Let /i be a probability distribution such that /io > and /i^ > for some k > 2. We are 
interested in the case where there exist no A > such that both Z\ < oo and is critical (see 
[18, Section 8] for a characterization of such probability distributions). An important example 
is when \x is subcritical (i.e. of mean strictly less than 1) and /ij ~ c/i° as i — > oo for a fixed 
parameter /3 > 2. The study of such GW M trees conditioned on having a large fixed size was 
initiated only recently by Jonsson & Stefansson [21] who called such trees non-generic trees. 
They studied the above-mentioned case where /ij ~ c/i 13 as i — > oo, with /3 > 2, and showed 
that if t n is a GW M tree conditioned on having total size n, then with probability tending to 1 as 
n — > oo, there exists a unique vertex of t n with maximal degree, which is asymptotic to (1 — m)n 
where m < 1 is the mean of fi. This phenomenon is called condensation and appears in a variety 
of statistical mechanics models such as the Bose-Einstein condensation for bosons, the zero-range 
process [20, 13] or the Backgammon model [4] (see Fig. 1). 

Jonsson and Stefansson [21] have also constructed an infinite random tree T (with a unique 
vertex of infinite degree) such that t n converges locally in distribution toward T (meaning roughly 
that the degree of every vertex of t n converges toward the degree of the corresponding vertex of 
T). See Proposition 2.7 below for the description of T. In [18], Janson has extended this result to 
simply generated trees and has in particular given a very precise description of local properties 
of Galton- Watson trees conditioned on their size. 

In this work, we are interested in the existence of scaling limits for the random trees t n . When 
scaling limits exist, one often gets information concerning the global structure of the tree. 



2 



Notation and assumptions. Throughout this work 9 > 1 will be a fixed parameter. We 
say that a probability distribution (fJ>j)j>o on the nonnegative integers satisfies Assumption (He) 
if the following two conditions hold: 



oo 



/i is subcritical, meaning that < }^ < 1. 

j=0 



(ii) There exists a function L : IR + — > IR + such that £(x) > for x large enough and 
lim^oo £(tx)/L(x) = 1 for all t > (such a function is called slowly varying) and 
fi n = L(n)/n 1+e for every n > 1. 

Let £(t) be the total progeny or size of a tree r. Condition (ii) implies that [((t) = n] > 
for sufficiently large n. Note that (ii) is more general than the analogous assumption in [21, 18], 
where only the case L(x) — > c as x — > oo was studied in detail. Throughout this text, 9 > 1 is 
a fixed parameter and /i is a probability distribution on N satisfying the Assumption (He). In 
addition, for every n > 1 such that P M [C(t) = n] > 0, t n is a GW M tree conditioned on having n 
vertices (note that t n is well defined for n sufficiently large). The mean of /i will be denoted by 
m and we set 7 = 1 — m. 

We are now ready to state our main results which concern different aspect of non-generic trees. 
We are first interested in the condensation phenomenon and derive properties of the maximal 
degree. We then find the location of the vertex of maximal degree. Finally we investigate the 
global behavior of non-generic trees by studying their height. 

Condensation. If r is a (finite) tree, we denote by A(r) the maximal out-degree of a vertex 
of r (the out-degree of a vertex is by definition its number of children). If r is a finite tree, let 
u*(t) be the smallest vertex (in the lexicographical order, see Definition 1.3 below) of r with 
maximal out-degree. The following result states that, with probability tending to 1 as n — > 00, 
there exists a vertex of t n with out-degree roughly and that the deviations around this typical 
value are of order roughly n 1 ^ 2 ^, and also that the out-degrees of all the other vertices of in 
are roughly of order 72 1 /( 2A6) ). In particular the vertex with maximal out-degree is unique with 
probability tending to 1 as n — > 00. 

Theorem 1. There exists a slowly varying function L such that if B n = L(n)n 1 ^ 2Ae \ the fol- 
lowing assertions hold: 



(ii) With probability tending to 1 as n — )■ oo ; the out-degrees of all the vertices of i n , except 
u*(in), are 0(B n ). 

(Hi) Let (l*)t>o be a spectrally positive Levy process with Laplace exponent E[exp(— Alt)] = 



When /i has finite variance a 2 G (0, 00), one may take B n = a^Jn/2. Theorem 1 has already 
been proved when [i n ~ c/n 1+e as n — > 00 (that is when L = c + o(l), in which case one may 




exp(tA 2Ae ). Then 




A(tn) - 7n (d) 
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choose L to be a constant function) by Jonsson & Stefansson [21] for (i) and Janson [18] for (ii) 
and (iii). However, our techniques are different and are based on a coding of t„ by a conditioned 
random walk combined with recent results of Armendariz & Loulakis [2] concerning random 
walks whose jump distribution is subexponential. The main advantage of this approach is that 
it enables us to obtain new results concerning the structure of t n . 

Localization of the vertex of maximal degree. We are also interested in the location 
of the vertex of maximal degree u+(t n ). Before stating our results, we need to introduce some 
notation. If r is a tree, let U(t) be the index in the lexicographical order of the first vertex of in 
with maximal out-degree (when the vertices of r are ordered starting from index 0). Note that 
the number of children of tt*(r) is A(r). Denote the generation of tt*(r) by |w*(t)|. Finally, if 
u G r, the pointed tree r cut at u is by definition the subtree of r obtained by removing all the 
descendants of u, pointed at u (meaning that the vertex u is distinguished). 

Theorem 2. The following three convergences hold: 

(i) For i > 0, P [[/(O = i\ — > 7 -P M [C(r) >z + l]. 

(ii) For i > 0, P [MO I = l \ — ► (1 - m ) m *- 

n— too 

(Hi) As n — > oo, the tree i n cut at tt*(0 converges locally in distribution toward the tree 7 cut 
at its vertex of infinite degree. 

See Proposition 2.7 below for the description of T. Using completely different techniques, a result 
similar to assertion (i) has been proved by Durrett [12, Theorem 3.2] in the context of random 
walks when i n is a GW M tree conditioned on having at least n vertices and in addition /i has finite 
variance. However, the so-called local conditioning by having a fixed number of vertices is often 
much more difficult to analyze (see e.g. [11, 29]). Note that Ya>\ P/j [C( r ) > ^] = E M [C( r )] — V7; 
so that the limit in (i) is a probability distribution. The proof of (i) combines the coding of in 
by a conditioned random walk with the use of local limit theorems. The proof of the other two 
assertions uses (i) together with the local convergence of tn toward the infinite random tree T, 
which has been obtained by Jonsson & Stefansson [21] in a particular case and then generalized 
by Janson [18], and was already mentioned above. 

A key consequence of Theorem 2 is that, informally, the tree t n looks like a finite spine of 
geometric length decorated with independent GW M trees, and on top of which are grafted A(t„) 
independent GW M trees. 

We are also interested in the size of the subtrees grafted on u*(0- If t is a tree, for 1 < 
j < A(r), let Cj( r ) be the number of descendants of the j-th child of «*(r) and set Zjir) = 
Ci(t) + C2( r ) + ■•• + Cj( r )- If I is an interval, we let 3(1, M.) denote the space of all right- 
continuous with left limits (cadlag) functions I — > K, endowed with the Skorokhod Jj-topology 
(see [5, chap. 3] and [16, chap. VI] for background concerning the Skorokhod topology). If 
x G R, let |_^J denote the greatest integer smaller than or equal to x. Recall that (Y t )t>o is the 
spectrally positive Levy process with Laplace exponent E[exp(— Alt)] — exp(tA 2A6 *). Recall the 
sequence (_B n ) n >! from Theorem 1. 
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Theorem 3. The following convergence holds in distribution in D([0, 

( z ^ (t ^- A(tw)t/7 ,o<t<i) ^ (iy tJ o<*<i). 

Note that in the case when fi has finite variance, we have 9 > 2 and Y is just a constant times 
standard Brownian motion. Let us mention that Theorem 3 is used in [19] to study scaling limits 
of random planar maps with a unique large face (see [19, Proposition 3.1]) and is also used in [8] 
to study the shape of large supercritical percolation clusters on random triangulations. 

Corollary 1. If > 2, max 1 < i <A(t n ) Ci{Ui)/B n converges in probability toward as n — > oo. If 

9 < 2, for every u > we have: 

exp (?f(b«r"*) • 

where T is Euler's Gamma function. 

Height of non-generic trees. One of the main contributions of this work it to understand 
the growth of the height CK(t n ) of t n , which is by definition the maximal generation in t n . We 
establish the key fact that J~C(t n ) grows logarithmically in n: 

Theorem 4. For every sequence (A n ) n >i of positive real numbers tending to infinity: 

P 

Note that the situation is completely different from the critical case, where !K(t„) grows like a 
power of n. Theorem 4 implies that CK(t n )/ln(n) — > ln(l/m) in probability as n — > oo, thus 
partially answering Problem 20.7 in [18]. Theorem 4 can be intuitively explained by the fact 
that the height of t n should be close to the maximum of the height of jn independent subcritical 
GW M trees, which is indeed of order ln(n). 

Since t n grows roughly as ln(n) as n — » oo, it is natural to wonder if one could hope to obtain 
a scaling limit after rescaling the distances in t„, by ln(n). We show that the answer is negative 
and that we cannot hope to obtain a nontrivial scaling limit for t n for the Gromov-Hausdorff 
topology, in sharp contrast with the critical case (see [11]). This partially answers a question of 
Janson [18, Problem 20.11]. 

Theorem 5. The sequence (ln(n) -1 •t n ) n >i is not tight for the Gromov-Hausdorff topology, where 
ln(n) -1 • tn stands for the metric space obtained from t n by multiplying all distances by ln(n) -1 . 

The Gromov-Hausdorff topology is the topology on compact metric spaces (up to isometries) 
defined by the Gromov-Hausdorff distance, and is often used in the study of scaling limits of 
different classes of random graphs (see [7, Chapter 7] for background concerning the Gromov- 
Hausdorff topology). 

However, we establish the convergence of the finite-dimensional marginal distributions of the 
height function coding t n . If r is a tree, for < i < £(r) — 1, denote by Hi(r) the generation of 
the i-ih vertex of r in the lexicographical order. 



P — max G(tn) < u 



ln(n) 



ln(l/m) 



< Xr 
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Theorem 6. Let k > 1 be an integer and fix < t\ < • • • < < 1. Then 

(-^L«*ij( t ™)'- ff N2j( t n), • • • ,H [ntki (t n )) (1 + e + e^ 1 + e + e 2 , . . . , 1 + e + e k ), 

where (ei)j> is a sequence of i.i.d. geometric random variables of parameter 1 — m (that is 
P [e = i] = (1 - m)m i for i>0). 

Informally, the random variable eo describes the length of the spine, and the random variables 
(ex, ... , efc) describe the height of vertices chosen in a forest of independent subcritical GW M trees. 
Note that these finite-dimensional marginal distributions converge without scaling, even though 
the height of tn is of order ln(n). 

This text is organized as follows. We first recall the definition and basic properties of Galton- 
Watson trees. In Section 2, we establish limit theorems for large conditioned non-generic Galton- 
Watson trees. We conclude by giving possible extensions and formulating some open problems. 

Acknowledgments. I am grateful to Nicolas Broutin for interesting remarks, to Gregory 
Miermont for a careful reading of a preliminary version of this work, to Olivier Henard for Remark 
2.3, to Pierre Bertin and Nicolas Curien for useful discussions and to Jean-Frangois Le Gall for 
many useful comments. 
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1 Galton- Watson trees 
1.1 Basic definitions 

We briefly recall the formalism of plane trees (also known in the literature as rooted ordered 
trees) which can be found in [28] for example. 
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Definition 1.1. Let N = {0, 1, 2, . . .} be the set of all nonnegative integers and let N* be the set 
of all positive integers. Let also U be the set of all labels defined by: 

oo 

U = U (M*) n 

n=0 

where by convention (N*)° = {0}. An element of U is a sequence u = U\ • • • of positive integers 
and we set \u\ = k, which represents the "generation" of u. If u = U\ - • -Ui and v = v% ■ ■ -Vj 
belong to U, we write uv — U\ • ■ ■ UiV\ • ■ ■ Vj for the concatenation of u and v. In particular, we 
have u® = 0ii = u. Finally, a plane tree r is a finite or infinite subset of U such that: 

(i) e r, 

(ii) if v G r and v = ui for some z G N*, then tt G r, 

(iii) for every u G r, there exists fc u (r) G {0, 1, 2, . . .} U {oo} (the number of children of u) such 
that, for every j G W, uj G r if and only if 1 < j < k u {r). 

Note that in contrast with [27, 28] we allow the possibility k u (r) = oo in (iii). In the following, 
by tree we will always mean plane tree, and we denote the set of all trees by T. We will often 
view each vertex of a tree r as an individual of a population whose r is the genealogical tree. 
The total progeny or size of r will be denoted by £(r) = Card(r). If r is a tree and u G r, we 
define the shift of r at u by T u r — {v G U; uv G r}, which is itself a tree. 

Definition 1.2. Let p be a probability measure on N. The law of the Galton- Watson tree with 
offspring distribution p is the unique probability measure P p on T such that: 

(i) F p [k9=j]=p(j)foTj>0, 

(ii) for every j > 1 with p(j) > 0, conditionally on {k = j}, the subtrees Tit, . . . ,TjT are 
independent and identically distributed with distribution F p . 

A random tree whose distribution is P p will be called a Galton- Watson tree with offspring dis- 
tribution p, or in short a GW p tree. 

In the sequel, for every integer j > 1, P p j will denote the probability measure on TP which is 
the distribution of j independent GW P trees. The canonical element of T- 7 is denoted by f. For 
f = (n, . . . , Tj) G T j , let C(f) = C(tl) + • • • + Cfo) be the total progeny of f. 

1.2 Coding Galton- Watson trees 

We now explain how trees can be coded by three different functions. These codings are important 
in the understanding of large Galton- Watson trees. 

Definition 1.3. We write u < v for the lexicographical order on the labels U (for example 
< 1 < 21 < 22). Let r be a finite tree and order the individuals of r in lexicographical order: 
= u (0) < w (l) < ••• < u(C(r) - 1). The height process H(t) = (H n (r),0 <n< ((r)) is 
defined, for < n < ((t), by: 

ff»(T) = |u(n)|. 

We set Hrt T )( T ) = f° r technical reasons. The height CK(r) of r is by definition max < n<< j( T ) H n {r). 
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Figure 3: The Lukasiewicz path W(r) and the height function H{r) of r. 

Consider a particle that starts from the root and visits continuously all the edges of r at unit 
speed, assuming that every edge has unit length. When the particle leaves a vertex, it moves 
toward the first non visited child of this vertex if there is such a child, or returns to the parent 
of this vertex. Since all the edges are crossed twice, the total time needed to explore the tree is 
2(C(r) — 1). For < t < 2(C(r) — 1), C T (t) is defined as the distance to the root of the position 
of the particle at time t. For technical reasons, we set C T (t) = for t G [2(C(r) — 1), 2((t)]. The 
function C(r) is called the contour function of the tree r. See Figure 3 for an example, and [11, 
Section 2] for a rigorous definition. 

Finally, the Lukasiewicz path W(r) = (W n (r),0 < n < C( r )) °f T is defined by W (t) = 
and for < n < £(r) — 1: 

W t H-i(r) = W ft (r) + fe u(n) (r)-l. 

Note that necessarily W^( T )(r) = — 1 and that U(t n ) = min{j > 0; W J+1 (t n ) — Wj(t n ) = 
A(t n ) — 1}, where we recall that U{r) is the index in the lexicographical order of the first vertex 
of t n with maximal out-degree. 

The following proposition explains the importance of the Lukasiewicz path. Let p be a critical 
or subcritical probability distribution on N with p(l) < 1. 

Proposition 1.4. Let (W n ) n >o be a random walk with starting point Wq = and jump distribu- 
tion v(k) = p(k + 1) for k > -1. Set ( = inf{n > 0; W n = -1}. Then (W , W x , ■ ■ ■ , W ( ) has 
the same distribution as the Lukasiewicz path of a GW p tree. In particular, the total progeny of 
a GW P tree has the same law as (. 

Proof. See [27, Proposition 1.5]. □ 
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1.3 The Vervaat transformation 



For x = (xi, . . . ,x n ) G Z n and i G Z/nZ, denote by x'*' the i-th cyclic shift of x defined by 

X k = X i+k mod n f° r 1 ___ & < 7i. 

Definition 1.5. Let n > 1 be an integer and let x = (xi, . . . , x n ) G Z™. Set Wj = X\ + ■ • • + Xj 

for 1 < j < n and let the integer «*(x) be defined by z*(x) = inf{j > l;wj = mini<j< n Wi}. The 
Vervaat transform of x, denoted by V(x), is defined to be x***^. 

The following fact is well known (see e.g. [30, Section 5]): 

Proposition 1.6. Let (W n ,n > 0) be as in Proposition 1.4 and X^ = Wk — Wk-i for k > 1. Fix 
an integer n > 1 such that P [W n = -1] > 0. The law of V(X 1 , . . . , X n ) under P [• | W n — — 1] 
coincides with the law of (X±, . . . , X n ) under P [ • | ( = n] . 

From Proposition 1.4, it follows that the law of V(X 1? . . . ,X n ) under P [• | W n = — 1] coin- 
cides with the law of (W^tn), W 2 (t„) — Wi(t„), . . . , W n (t n ) — W n _x(t n )) where t n is a GW P tree 
conditioned on having total progeny equal to n. 

We now introduce the Vervaat transformation in continuous time. 



Definition 1.7. Set D ([0,1],R) = {to G D([0,1],R); u(0) = 0}. The Vervaat transformation 
in continuous time, denoted by V : Do([0, 1],M) — > D([0, 1],M), is defined as follows. For u> G 
D ([0, 1],R), set gx(u) = inf{t G [0, 1]; u(t-) A u(t) = inf [0 ,i]a;}. Then define: 



By combining the Vervaat transformation with limit theorems under the conditional prob- 
ability distribution P[ - | W n = —1] and using Proposition 1.4 we will obtain information about 
conditioned Galton- Watson trees. The advantage of dealing with P [ ■ | W n = — 1] is to avoid any 
positivity constraint. 

1.4 Slowly varying functions 

Recall that a function L : R + — > R + is said to be slowly varying if L(x) > for x large enough 
and linXj.__i.oo L(tx)/L(x) = 1 for all t > 0. Let L : R + — > R + be a slowly varying function. 
Without further notice, we will use the following standard facts: 

(i) The convergence lim^oo L(tx)/L(x) = 1 holds uniformly for t in a compact subset of 



(ii) Fix e > 0. There exists a constant C > 1 such that -^x e < L{nx)/L{n) < Cx 6 for every 
integer n sufficiently large and x > 1. 

These results are immediate consequences of the so-called representation theorem for slowly 
varying functions (see e.g. [6, Theorem 1.3.1]). 




if gx{oj)+t < 1, 
if gi(u) + t>l. 



(0,oo). 
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2 Limit theorems for conditioned non-generic Galton- Watson 
trees 

In the sequel, (W n ; n > 0) denotes the random walk introduced in Proposition 1.4 with p = \i. 
Note that E [Wi] = -7 < 0. Set X = and X fc = W k - W k -i for fc > 1. It will be convenient 
to work with centered random walks, so we also set W n = W n + and X n = X n + 7 for n > 
so that W n = X\ + • • • + X n . Obviously, W n = —1 if and only if W n = 771—1. 



2.1 Invariance principles for conditioned random walks 

In this section, our goal is to prove Theorem 1. We first introduce some notation. Denote by 
T : U„>ilR n U n >ilR n the operator that interchanges the last and the (first) maximal component 
of a finite sequence of real numbers: 



T(xi, . . .,x n ) k = < 



max Xn if k = n 

l<i<n 

x n if x k > max Xj and x k = max Xi 

l<i<k k<i<n 

Xk otherwise. 



Since \i satisfies Assumption (He), we have P [Wi G (x,x + l]j ~ L(x)/x 1+e as x — >■ 00. 
Then, by [10, Theorem 9.1], we have: 

P [W n e(x,x + 1]] ~ nP [Wi G (x, x + 1]] , (1) 

uniformly in x > en for every fixed e > 0. In other words, the distribution of W\ is (0, 1]- 
subexponential, so that we can apply a recent result of Armendariz & Loulakis [2] concerning 
conditioned random walks with subexponential jump distribution. In our particular case, this 
result can be stated as follows: 



(Li- 



Proposition 2.1 (Armendariz & Loulakis, Theorem 1 in [2]). For n > 1 and x > 0, let fi 

be the probability measure on W 1 which is the distribution of (Xi, . . . ,X n ) under the conditional 
probability distribution P [• | W n G (x,x+ 1] 
Then for every e > 0, we have: 



lim sup sup ii njX o T- 1 [A x R] - ^ n ' 1] [A] 

n ^°° x>en AeSSpR"- 1 ) 



0. 



As explained in [2], this means that under P [ • | W n G (x, x + 1]] , asymptotically one gets n — 1 
independent random variables after forgetting the largest jump. 

The proof of Theorem 1 is based on the following invariance principle concerning a conditioned 
random walk with negative drift, which is a simple consequence of Proposition 2.1. 

Proposition 2.2. Let U be a uniformly distributed random variable on [0, 1]. Then the following 
convergence holds in D([0, 1],R): 



^,0<t<l 
n 



W n = -1) ^ (-7* + 7%<*,0<t<l). (2) 



n— > 00 
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Proof. By the definition of W, it is sufficient to check that the following convergence holds in 
D([0,1],R): 



W 



\nt\ 



< t < 1 



W n = jn — 1 



n— >oo 



(7lt/<t,0<t<l) 



(3) 



where {7 is a uniformly distributed random variable on [0, 1]. Denote by V n the coordinate of the 
first maximal component of (X 1; . . . , X n ). Set Wq = and for 1 < % < n — 1 set: 



W 



X 1 + X 2 + ■ ■ ■ + Xi if % < V n 

Xi + X 2 H h X Vn -i + X Vn +i H h Xi+i otherwise. 



By Proposition 2.1, for every e > 0: 



lim 


P 


Vt e [0,1], 




< e 


n— >oo 




n — 1 





^ = 7^-1 



P 



We [0,1], 



L(n-l)tJ 



72—1 



< e 



0. (4) 



We now claim that for every e > 0: 



P 



We [0,1], 



n — 1 



< e 



(5) 



To establish (5), note that (|Wi|)i>o is a submartingale, so that Doob's maximal inequality 

entails: _ 

E[|WVi| 



P 



max Wi > (n — l)e 

Ki<n-1 



< 



By [31], E [|W„-i|] /(n-l)^0asn-> oo. This shows (5). 

Combining (5) with (4), we get the following convergence in D([0, 1], 



Wi 



L("-i)«J 
n — 1 



< t < 1 



W n = 7n - 1 



0. 



(6) 



where stands for the constant function equal to on [0, 1]. In addition, note that on the event 
{W„ = 771— 1}, we have Xy n = — 1 — W[ n _ij . The following joint convergence in distribution 
thus holds in D([0, 1], E) ® R: 



W V{n-l)t\ . Q < t < 1 \ X Vn 



n — 1 



n 



W n = 'jn-1 



(0,7)- 



(7) 



Standard properties of the Skorokhod topology then show that the following convergence holds 
in D([0,1],R): 



W 



\nt\ X Vj 



n 



n 



l {t >^ } ;0< t < 1 



W n = >yn - 1 



0. 



Next, note that the convergence (7) implies that under P [ • | W n = 771 — ll , (X±, . . . , X n ) has a 
unique maximal component with probability tending to one as n — > 00. Since the distribution of 
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(X 1 ,...,X n ) under P[-| W n — 777 — l] is cyclically exchangeable, one easily gets that the law 
of V n /n under P [ • | W n = 7n — l] converges to the uniform distribution on [0, 1]. Also from (7) 
we know that Xy n /n under P [ • | W n = 777 — l] converges in probability to 7. It follows that 



-rl 



,0 < t < 1 



772 — 1 



00 

n— >oo 



(7ltr<t,0<t<l), 



(9) 



where U is uniformly over [0, 1]. Since (8) holds in probability, we can combine (8) and (9) to 
get (3). This completes the proof. □ 

Using completely different techniques, a result similar to Proposition 2.2 has been proved by 
Durrett [12, Theorem 3.2] when the conditioning by the event {W n = —1} is replaced by the 
conditioning by the event {W n > —1}. However, the so-called local conditioning by {W n = — 1} 
is often more difficult to analyze (see e.g. [2]). 

Before proving Theorem 1, we need to introduce some notation. For x = (xi, . . . ,x n ) G Z n , 
set M(x) — maxx<j< n Xj. Recall the notation V(x) for the Vervaat transform of x. Note that 
M(x) = M(V(x)). Let F : R — > R be a bounded continuous function. Recall that A(t n ) denotes 
the maximal out-degree of a vertex of t„. Since the maximal jump of W(t„) is equal to A(t„) — 1, 
it follows from the remark following Proposition 1.6 that: 



E[F(A(t n ))] 



E[F(M(V(X 1; X 2 , 
E[F(M(X 1 ,X 2 ,... 



..,X n )) + l)\W n = -l] 
,X n ) + l) \ W n = -l] 



(10) 



Recall that since /x satisfies Assumption (Hg), W\ belongs to the domain of attraction of a 
spectrally positive strictly stable law of index 2 A 9. Hence there exists a slowly varying function 
L such that W n / (L(n)n 1 ^ 2A6, - ) ) converges in distribution toward Y\. We set B n = L(n)n 1 ^ 2A ^ 

and prove that Theorem 1 holds with this choice of B„. The function L is not unique, but if L 
is another slowly function with the same property we have L(n)/L(n) — > 1 as n — > 00. So our 
results do not depend on the choice of L. Note that when /1 has finite variance a 2 , one may take 
B n = ay/n/2, and when L = c + o(l) one may choose L to be a constant function. 
We are now ready to prove Theorem 1. 

Proof of Theorem 1. If Z e ©([0, 1],M), denote by A(Z) = sup 0<s<1 (Z s — Z S S) the largest jump 
of Z. Since A : D([0, 1],M) — > M is continuous, from Proposition 2.2 we get that, under the 
conditioned probability measure P [ ■ \W n = —1], M (Xi,X%, . . . , X n ) jn converges in probability 
toward 7 as n — > 00. Assertion (i) immediately follows from (10). 

For the second assertion, keeping the notation of the proof of Proposition 2.2, we get by 
Proposition 2.1 that for every bounded continuous function F : D([0, 1],M) — > R 



lim 

n— >oo 



E 



L(n-l)tJ . 



< t < 1 



W r . 



771—1 



-E[F(Y t ,0<t<l)] 



0. 



'Ill 



The second assertion follows, since the jumps of fw^n-i)*] ' ^ — ^ — ^) have the same distribution 
as the out-degrees, minus one, of all the vertices of t n , except «*({.„). 
For (iii), if V n is as in the proof of Proposition 2.2, note that we have 



M(X 1 ,X 2 , 



X r 



777—1 



777, — 1 — w, 



n—1 



12 



on the event \W n = — lj. As noted in [2, Formula (2.7)], it follows from (11) that 



fM(X 1 ,X 2 ,...,X n )- 1 n 



W n = 1 n-l) A -Y x . 



n— >oo 



\ B, 

Since M(X 1; X 2 , ■ ■ ■ , X n ) = M(Xi, X 2 , . . . , X n ) + 7, we thus get that 

M(X 1 ,X 2 ,...,X n ) + l- 1 n w = _\ _^ 



B n 

Assertion (iii) then immediately follows from (10). This completes the proof. □ 

Remark 2.3. The preceding proof shows that assertion (i) in Theorem 1 remain true when /i is 
subcritical and both (1) and Proposition 2.1 hold. These conditions are more general than those 
of Assumption (Hg): see e.g. [10, Section 9] for examples of probability distributions that do not 
satisfy Assumption (Hg) but such that (1) holds. Note that assertions (ii) and (iii) in Theorem 1 
rely on the fact that fi belongs to the domain of attraction of a stable law. Note also that there 
exist subcritical probability distributions such that none of the assertions of Theorem 1 hold (see 
[18, Example 19.37] for an example). 

Recall that U(r) is the index in the lexicographical order of the first vertex of t n with maximal 
out-degree. We will need the following useful consequence of Theorem 1: 

Corollary 2.4. Fix c G (0,7). We have U(i n ) = min{j > 0; W j+1 (t„) - W^tJ > cn} with 
probability tending to one as n — > 00. 

This is an immediate consequence of Theorem 1. Indeed, assertions (i) and (ii) of the latter 
theorem imply that for every c G (0,7), there is a unique vertex of t n with out-degree at least 
cn, with probability tending to one as n — > 00. 

By applying the Vervaat transformation in continuous time to the convergence of Proposition 
2.2, standard properties of the Skorokhod topology imply the following invariance principle for 
the Lukasiewicz path coding t n (we leave details to the reader since we will not need this result 
later). See Fig. 4 for a simulation. 

Proposition 2.5. The following assertions hold. 

(i) We have: 

sup — > 0. 

o<i<t/(t„) n n ^°° 

(ii) The following convergence holds in distribution in D([0, 1],R): 

/ W LntJV( ^ (M+1) (t ra ) K1 \ S% ( 7 (i_i), <f<l). 

Using completely different techniques, a similar result has been proved in the context of condi- 
tioned random walks by Durrett [12, Theorem 3.2] when t n is a GW M conditioned on having at 
least n vertices and in addition /1 has finite variance. However, the so-called local conditioning by 
having a fixed number of vertices is often more difficult to analyze (see e.g. [11, 29]). Property 
(i) shows that (W^j (t n )/n, < t < 1J does not converge in distribution in D([0, 1],R) toward 
(7(1 — t), < t < 1) and this explains why we look at the Lukasiewicz path only after time U(tn) 
in (ii). 
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v 



0.2 0.4 0.6 0.8 



Figure 4: A simulation of a Lukasiewicz path of a large non-generic tree. 



2.2 Location of the vertex with maximal out-degree 

Our goal is to prove Theorem 2. We first need to introduce some notation. It is well known that 
the mean number of vertices of a GW^ tree at generation n is m n . As a consequence, we have 
[((t)] = l + m + m 2 + -- - = l/(l — m) = I/7. Moreover, for n > 1, by Kemperman's formula 
(see e.g. [30, Section 5]): 



1 



n 



n 



P[W n = -l] = -P [W n = 7n-1 



L(n) 



{in) 



1+8 ' 



12) 



where we have used (1) for the last estimate. It follows that the total progeny of a GW M tree 
belongs to the domain of attraction of a spectrally positive strictly stable law of index 2 A 6. 
Hence we can find a slowly varying function L' such that the law of (C(f) ~ n /l)/ {^L'{n)n l ^ 2 ^ e ^ 
under converges as n — > 00 to the law of Y\ , where we recall that P M j is the law of a forest of 
j independent GW M trees. We set B' n = L' [n)n l ^ 2/ ^ e \ The local limit theorem (see [15, Theorem 
4.2.1]) implies that if we set (pj(k) = P^j [£(f) = k] for j > 1 and k > (with the convention 
(fj(k) = for k < 0), then 



lim sup 



k — m/7 

m 



0. 



(13) 



where pi is the density of Y\. It is well known that pi is a bounded continuous function (see e.g. 
[34, I. 4]). 

The following technical result establishes a useful link between B n and B' n . 
Lemma 2.6. We have B'jB n ->■ l/ 7 1+1 /(2Ae) flS n _^ ^ 

The proof of Lemma 2.6 is postponed to the end of this section. 

Proof of Theorem 2 (i). Fix e > and an integer i > 0. By Theorem 1 (iii) and Lemma 2.6, we 
may choose A > such that: 



|P[tf(tn) = io] -P[tf(tn) = i , IMU < I < e 



(14) 
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By Theorem 1, for n large enough: 

|P [U(Q = 2 , IMU - -yn\ < AB' n ] - P [\k l0 (t n ) - 77i| < < e. (15) 

By Proposition 1.4, we have for n large enough: 
P[|fe i0 (f n )- 7 n| <AB' n ] 

= P [C(r 1 ) = n] S P[A 4o+1 =j-l,C = n] 

H P ^'rX\ i n1 ^ E [ 1 {Vm<io,W m >0}^ !0 +i(^ - io - 1)] , (16) 
\j--yn\<AB' n M LS V J J 

where we have used the Markov property of the random walk W at time iq + 1 for the last 
equality. To simplify notation, set j n (u) = \jn + uB' n \ for !i£l. Then write: 



<L p r^/ n - n E [ 1 {Vm<i ,W m >0}^iy io +i^ - «0 - 1 

|j-7n|<AB; /MS l / J 



E 
B'E 



l{Vm<i ,W m >0} / p fA/^ _ Zl <^ + L7™+«j(™ - »0 - l y 



Aa;+o(i) p [Xi = [7n + uj - 1] 
ab;-o(i) P^ [C(r) = n] 

M +0( i) P [x 1= j w ( M )_i] 
{ vm<i ,^>0} y_ A _ o(i) du P„[C(r)=n] ^o+i«(«)(" " »o - !) 



(17) 



By using the dominated convergence theorem, we will now show that for a fixed integer k > 

(18) 

We will then apply (18) with = Wj . Since /i satisfies assumption (ii) in (i?#)> we have 
P[Xi =j n (u) - 1] = £(j„,(M))/(j n (M)) 1+e . From (12), it follows that 



sup 

-A-Ku<A+l 



p[x 1= j n ( M )-i] _ 1 



^ [CM = n] 



n— >oo 



(19) 



Next, note that B' n /B' k+jn ^ — > \/^ l /^ e ) as n — > oo, uniformly in it G (—A — 1, A + 1). From 
(13), it follows that 

, . ^ 1 /n-z -l-i(A:+L7n + «^ n J)\ 
^^(n-zo-l)-^^^ I- ^ 0. 

Hence 

1 ( 1 M \ 

B n ^k+j n {u){n — «o — 1) ^ yT^P^"--^!^);- 

In addition, by (13), there exists a constant C > which is independent of k and such that 
< B' n Lpf :+ j n ( u }(n — i — 1) < C for every n sufficiently large and u G (— A — 1,A + 1). The 
convergence (18) then follows from an application of the dominated convergence theorem. 
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Another application of the dominated convergence theorem gives that the expression appear- 
ing in (17) converges toward 



/A 



7 7 



1/(2A0) 



as n — > oo. It follows from (14), (15) and (16) that for n sufficiently large: 



rA d u / 

P [U{Q = i ] - P [Vm < i , W m > 0] J a ^j^Pi 



u 



7 7 



1/(2A0) 



< 3e. 



Finally, since pi is the density of a probability distribution, we have: 



lim 

A-^-OO 



du 



y/(2A0) 



Pi 



■it 



(/;/ 



7 1/(2A0) 



Pi 



1 



7 7 



1/(2A0) 



7- 



7 7 V(2Ae) 

Thus, by choosing A sufficiently large, we get that for n sufficiently large 

|P [U (Q = io] - 7P [Vm < i , W m > 0] | < At. 

Since this holds for every e > 0, we get lim^oo P \U{l n ) = i ] = 7P [Vm < ioW m > 0]. By 
Proposition 1.4, we have P [Vm < io, W m > 0] = P M [C(t) > io + 1]- This completes the proof of 
assertion (i) in Theorem 2. □ 

To prove the second assertion of Theorem 2, we will need the size-biased distribution associ- 
ated with /i, which is the distribution of the random variable (* such that: 



F[C = k] :-. 



m 



fc = 0,1 



The following result concerning the local convergence of t ra as n — > 00 will be useful. We refer 
the reader to [18, Section 6] for definitions and background concerning local convergence of trees 
(note that we need to consider trees that are not locally finite, so that this is slightly different 
from the usual setting). 

Proposition 2.7 (Jonsson & Stefansson [21], Janson [18]). Let 7 be the infinite random tree 
constructed as follows. Start with a spine composed of a random number S of vertices, where S 
is defined by: 



F[S = t] = (1 -m)m l , 2 = 1,2,.... (20) 

Then attach further branches as follows (see also Figure 5 below). At the top of the spine, attach 
an infinite number of branches, each branch being a GW M tree. At all the other vertices of the 
spine, a random number of branches distributed as — 1 is attached to either to the left or to the 
right of the spine, each branch being a GW^ tree. At a vertex of the spine where k new branches 
are attached, the number of new branches attached to the left of the spine is uniformly distributed 
on {0, . . . , k}. Moreover all random choices are independent. 
Then i n converges locally in distribution toward T as n — ^ 00. 

Proof of Theorem 2 (ii) and (Hi). By Skorokhod's representation theorem (see e.g. [5, Theorem 
6.7]) we can suppose that the convergence t n — > 7 as n — > 00 holds almost surely for the local 
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Figure 5: An illustration of 7. Here, the spine is composed of the vertices 

0,1,13,13,131,1312. 



topology. Let G T be the vertex of the spine with largest generation. By (20), we have for 
% > 0: 

P[|^| = i] = (1 -m)m\ (21) 

Recall the notation U(t n ) for the index of u*(tn). Let e > 0. By assertion (i) of Theorem 2, 
which was proved at the beginning of this section, we can fix an integer K such that, for every 
n, P [U (t n ) < K] > 1 — e. From the local convergence of t„ to 7 (and the properties of local 
convergence, see in particular Lemma 6.3 in [18]) we can easily verify that 

p [K(y + u*} n {U(Q < K}} —> o. 

We conclude that P [w*(t n ) ^ u+] — > as n — > 00. Assertion (ii) of Theorem 2 now follows from 
(21). Similarly, the tree t n cut at ii*(tn) converges locally in distribution toward the tree 7 cut 
at it*, giving assertion (iii). □ 

Note that assertion (i) in Theorem 2 was needed to prove assertions (ii) and (iii). Indeed, the 
local convergence of t n toward 7 would not have been sufficient to get that P [M*(t n ) 7^ «*] — > 0. 
We conclude this section by proving Lemma 2.6. 

Proof of Lemma 2.6. Let a 2 be the variance of fx. Note that a 2 = 00 if 9 G (1,2), a 2 < 00 if 
6 > 2 and that we can have either a 2 = 00 or a 2 < 00 for 9 = 2. When a 2 = 00, the desired 
result follows from classical results expressing B n in terms of fi. Indeed, in the case 9 < 2, we 
may choose B n and B' n such that (see e.g. [26, Theorem 1.10]): 

B' n = inf {x > 0; P M [C(r) > g] < |) 
S n inf {x > 0; /i([x, 00)) < J} 

Property (ii) in Assumption (He) and (12) entail that [£(r) > x]//i([x, 00)) — >■ 1/7 1+61 as 
x — )■ 00. The result easily follows. The case when a 2 = 00 and 9 = 2 is treated by using similar 
arguments. We leave details to the reader. 
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We now concentrate on the case a < oo. Note that necessarily 9 > 2. Let a be the 
variance of C(r) under P M (from (12) this variance is finite when a 2 < oo). We shall show that 
a' = cr/7 3 / 2 . The desired result will then follow by the classical central limit theorem since we 
may take B n = and B' n = a'Jri/2. In order to calculate a' 2 , we introduce the Galton- 

Watson process (3i)«>o with offspring distribution \i such that 3o = !• Recall that E [3«] 
Then note that: 

a' 2 =E fl [ar) 2 ]-EMrf ~ 



m 



E 



E3, 

\i=0 



7 



2 ' 



Since (3i/m J )i>o is a martingale with respect to the filtration generated by (3i)i>cb we have 
E [ZiZj] = m?~ l "E [Zf]. Also using the well-known fact that for % > 1 the variance of 3i is 
l)/(m — 1) (see e.g. [3, Section 1.2]), write: 



a 2 m % 1 (m l 



E 



E3, 

\i=0 



oo 00 / 

EE K] +2 E tn^E[3 4 a ]=£E[3 4 a ](l + T 

i=0 0<i<? i=0 V 



2m 



\ i=i 

— + — 

^y2 



a^-^m* - 1' 



m 



m- 1 



+ m 



2/ 



1 + 



2m 



m 



This entails a' = a/7 3 / 2 and the conclusion follows. 



□ 



2.3 Cutting the tree at the vertex with maximum out-degree 

To analyze the structure of t n we shall cut t n into two subtrees, one being the subtree of the 
descendants w*(t n ) and the other being the subtree formed by the vertices which are not descen- 
dants of u*(t n ). A major difficulty is that these two subtrees are not independent. To overcome 
this difficulty, the idea is to use a different conditioning: we shall reduce the study of a tree 
conditioned on having total size n to the study of a tree conditioned by the event that its first 
vertex with strictly more than 7^/2 children has a fixed out-degree. The main advantage is that 
under this conditioning the two subtrees become independent. 

We need to introduce some important notation which will be used throughout this work. 
If r is a tree, let u* • (t) be the first vertex of r in lexicographical order with strictly more 
than 7n/2 children (with the convention u* (r) = d if there is no such vertex, where d is a 
cemetery point). Let K n (r) be the number of children of u^\t) (with the convention K n (r) = 
if u*(t) = d). Recall the notation T u t for the tree shifted at u. For 1 < j < K n (r), let 
u^\t) denote the j-ih child of (r) and set rj n ' ) = T^ n ){r). For 1 < i < j < n n {r), we set 

3^ = 3 r -"' ) (r) = (r t , . . . , t j -™' > ). By definition, a pointed tree is a tree with a distinguished 
vertex. Finally, if u G r, let Cut u (r) be the tree obtained from r by removing all the descendants 
of u and pointed at u. To simplify notation, set Cut^(r) = Cut („), At), with the convention 

Cut (n) (r) = d if u^\t) = d (that is if there is no vertex of r with strictly more than 7n/2 
children). Recall the notation F^j for the law of a forest of j independent GW^ trees. The 
following result easily follows from the properties of Galton- Watson trees: 
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Lemma 2.8. For every j > 1 such that P M [AC n (r) = j] > 0, under the conditional probability 
measure P M [ • | K n — j], the random variables Cut^ and are independent and, in addition, 
3ij is distributed according to P^j and Cut^ has the same distribution as Cu&^ under the 
conditioned probability measure [-\Cut {n) ^d]. 

We next show that under the conditioned probability measure P M [■ |Cut (ri) ^ d], the tree 

Cut^ converges locally in distribution toward Cut Ujt T, where 7 is the infinite tree appearing in 
Proposition 2.7. 

Lemma 2.9. Under the conditioned probability measure converges lo- 

cally in distribution toward Cut u J7 as n — > oo. 

Note that Lemma 2.9 is not a simple consequence of Theorem 2 (iii) since the events involved 
in the conditionings are not the same. Before we proceed to the proof of Lemma 2.9, we state a 
technical lemma. 

Lemma 2.10. Set j n (u) = |_7^ + uB' n \ for u G R. Then for every A > 0: 

jn(u)} 1 



lim sup 

n_>0 ° we [~A-1,A+1] 



0. 



IV [C(t) = n] 7 

Proof. By Proposition 1.4, for n sufficiently large, we have for every u G [—A — 1, A + 1]: 

P M Mr) = j„(u)] 



8=1 



*i < y> • • • < = Jn(u) - 1, W m > for 1 < m < i - 1 



P[Xi=j n (u)-l]£p 



i=l 



Xi < . . . < W m > for 1 < m < z - 1 



The monotone convergence theorem entails: 

P M [«„(r) = j„(w)] 



P[X!=j n (u)-l] 



E P t^m > for 1 < m < i - 1] 



uniformly in w G [—A — 1, A + 1]. By Proposition 1.4, the last sum is equal to : 

OO -1 

£ F m[C(t)>*]=E,[C(t)] = - 

i=l ' 



In addition, (19) gives that P [X\ = j n (u) — 1] /P M [C( r ) = n ] converges toward 1 as n — > oo, 
uniformly in u G [—A — 1, A + 1]. The conclusion immediately follows. □ 

Proof of Lemma 2.9. Fix a tree To pointed at a leaf. By Theorem 2 (iii), Cut Ui ( tn )(t n ) converges 
locally in distribution toward Cut Mt T as n — > oo. In addition, by Corollary 2.4, u* (t^) = w*(t„) 
with probability tending to one as n — > oo. It follows from Theorem 1 (iii) and Lemma 2.6 that 
(ftn(tn) — l n )/B' n converges in distribution as n — > oo. We can thus choose A > such that for 
every n sufficiently large: 



P[Cut Mtn) (t n ) 



P [Cut^(fn) = r , \K n (l n ) - 7 n| < AB' n 



< e. 



(22) 
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It is thus sufficient to show that, for an appropriate choice A, for n sufficiently large we have 



771 



|p[Cut (n) (t n ) =T ,Mt ri 

To this end, write 

P [Cut( n )(t„) = r , \n n {i n ) - <yn\ < AB' n 



< AB' n ] - P M [Cut (n) (r) = r |Cut (n) ^ d] \ < 2e. (23) 



W I K n = J 



P„ [Cut<"'(r) = r„|Cut<"> * 9] £ ^#^4r^(" -o'r„!l. 



|i-7n|<AS; 



, P/x [COO = n] 



(24) 



where we have used Lemma 2.8 for the last equality and recall the notation (pj(k) = P MJ - [C(f) = k]. 
Using Lemma 2.10, the same argument that led us to (18) shows that 



P M [«n = j] 



\p ^ ft 

^ v P.[C(r 



n 



■<Pj{n - C{t )) 



, A dM 7 l + l/(2A g ) Pl 



y+i/(2A0) 



)■ 



Since the term appearing in the right-hand side of the previous expression converges to 1 as 
A —7- 00, we may choose A sufficiently large so that 



1+1/(2A6») J 



7 

This implies that for n sufficiently large 

P„ [*n = j] 



7 



1+1/(2A0) 



- 1 



< e. 



E 



b -- 7 ,|<^ P m [COO = n] 
Then (23) follows from (24). This completes the proof. 



<Pi(n - C(t )) 



< 2e. 



□ 



2.4 Subtrees branching off the vertex with maximum out-degree 

We now turn to the proof of Theorem 3. By Corollary 2.4, we have u* (in) — v>*(tn) with 
probability tending to one as n — > 00. It is thus sufficient to establish Theorem 3 when the 
process Z is replaced by the process defined as follows. For every tree r and for every 
1 < j < ^n00> let C ( j l \ T ) = C( r j™' ) ) be the number of descendants of uj ; (t), and set Zj (t) = 
Cf n) (r) + d n) ( T ) + • • • + Cj n \r). We need to prove that: 

Z ^ ( ^" Mtw)t/7 ,0<t<l^ A (iy.,0<t<l). (25) 

Let (Zj)j>o be the random walk which starts at and whose jump distribution has the same 
law as the total progeny of a GW M tree. Note that P [Zj = k] = P w - [£(f) = fc]. Recall from 
the beginning of Section 2.2 that the distribution of Z\ belongs to the domain of attraction of 
a spectrally positive strictly stable law of index 2 A 6. In particular, if B' n is defined as in the 
beginning of Section 2.2, the following convergence holds in distribution in the space D([0, 1], R): 

Z ^- nt/ -l,0<t<l) ^> (Y t ,0<t<l). (26) 



B 
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Proof of Theorem 3. We shall show that for every fixed rj G (0, 1): 



B' 



0<t<7] 



(tn) 



n— >oo 



(y t ,o<*<77). 



(27) 



Recalling Lemma 2.6, our claim (25) will follow from a time-reversal argument since the vec- 



tors (^(tn),^^),,...,^^)^)) and (CagW-CiuHW'-'^W) haVe the 
distribution. Tightness follows from the time- reversal argument and also continuity at t = 1. 

Let F : D([0, ?i],R) — > M + be a bounded continuous function and to simplify notation set, for 

every tree r: 

Z E(r)tj( T ) -«n0")*/7 



u(n) 



(n) 



*(n) 



same 



5' 



.0 < * < 77 



Fix e > 0. As in the proof of Lemma 2.9, we can choose A > such that for every n sufficiently 
large: 



E„ [F (Z^\r)) I C(r) = n] - E„ [F (Z<"»>(t)) l {Mr) _ 7n |<^ } | C(r) = n] 



< e. 



(2? 



Now let Nq (t) = C( r ) + 1 — C (-^V™) 7 ") denote the number of vertices of r which are not 
strict descendants of ui n ^(r), so that NQ n \r) = C(Cut^ ri ' ) (r)). Without risk of confusion, in the 
sequel we write K n instead of K n (r), Nq instead of Nq U \t) and so on. Obviously {C( r ) = n} = 
{Zj^ + Nq 1 ^ = n}. To simplify notation, let (Z 3 -) 3 ->q and be independent under P and such 
that (Zj)j>o is distributed as explained before (26), and has the same law as Nq under 
P^ [• |Cut (n) ^ d]. Finally, set for k > 1: 



Z (k) 

By Lemma 2.8: 

E, [F (Z<"»>) l {lKn ^ nl <AB' n} 



< t < 77 



CO 



n 



E 

yn|<y 

E 



\j-Tn\<AB' n P M [C(^) = n 
[«n = J] 

„ P M [C(r) = n 



v y C(?i,j)+JVo n 



(29) 



Recall the notation yj(A;) = P[2j = fc]. To simplify notation, for integers j, n > we set 



ft 



(n) 



E 



. It follows from the Markov property of the random walk (Z>j)j>o 



applied at time that: 



ft 



(n) 



E 



^(Z W )Vi-Wj(n-^ n) -Zwj) 



Recall that j n (w) = ['jn + W-B^J for u £ R. Then the sum appearing in (29) is equal to: 



ab;+o(i) 



P, [COO 



n\ 



A+o(l) P„[« n = 7 n (w)l mW 



A+o(l) 



[C(r) 



(30) 
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We shall now show that there exist a, (3 > such that : 



-A+o(l) 



P. [CM 



ftV' . — ^ — / duE 



now, | - 1 



(31) 



where pi is the density of Y\. Recall that p\ is a bounded continuous function. 
We first show that there exist a, (3 > such that for fixed u G K: 



/3 



(32) 



To this end, we start by establishing a few useful convergences. Set a = (1 — ?^) 1 /( 2A6 '). By (26) 
and Lemma 2.9, we have the following joint convergence in distribution: 



Z ij > 



(«)) («)*>] -jn(u)vh ^(n) \ W 

3n(*0-|.»7.7»Wj 



(^)o<i<?7, , 

a 



2 N 



(33) 



where N is a finite random variable independent of Y. Note that for every sequence of real 
numbers (r n ) n >i such that r n — > oo as n — > oo we have ^ n) /r„ -»■ as n — )■ oo, in probability. 

Set /3 = (7(1 - ?7 )) 1 /(2Ae) > g ince B' n / B' jn(u) _ lvjn{u)i ->■ as n ->■ 00, it immediately follows 
that: 

n - - 3n{u)/l (p). 



Jf»(«0-L>Un(ti)J 



"07 



(34) 



We now establish (32). By the definition of !R. , ^ we have: 



B> 



TDI 

jn(u)-\rij n (u)\ 



E 



F (ZV"V») Bl iuyiWn{u)] VMu)-lvMu)\(n ~ # - Z lvjn{u)i ) 



To simplify notation, set G n = B' jn{uy[riMu)lVjn{u) _ lvjn(u)l (n-N^ ] - Z lvjn(u)i ). By (13), we have: 

n - - Zi vMu)] - Mjniu) - bu'n(w)J) 



lim 

n— s>oo 



It follows from (33) and (34) that: 
F (Z ij " {u)) ) G r . 



^jn(u)-[rij n (u)i 



0. 



In addition, by (13), there exists a (deterministic) constant C > such that < G n < C for 
every n > 1. Using the fact that ^/-Sj„( u )-Uj n ( u )j ~~ the convergence (32) follows from the 
dominated convergence theorem. 

Let us now establish the convergence (31). By Lemma 2.10, the convergence 



[«n = 3n(u)] 

^ [C(r) = n] 



7 



(35) 



holds uniformly in u G [—A — 1, A + 1]. Moreover, it is clear from (13) that B'JJ^™,-, is bounded 



uniformly in n > 1 and w G [— A— 1, A+l]. The convergence (31) then follows from an application 
of the dominated convergence theorem after taking into account (35) and (32). 
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By Fubini's theorem: 



1 

fa 



duE 



-E 



0<t<v) 



fa 

E[F((F t )o<t<,)] 



dupi 



u 
'fa 



Y 



where we have used the fact that p\ is a probability distribution in the second equality. Hence 
we can choose A > sufficiently large so that in addition to (28) we also have: 



fa 



duE 



F((Y t ) < t < v ) Pl 



u 

'fa 



Y 



E[F((Y t ) < t < v )} 



< e. 



By putting (29), (30) and (31) together, we get that for n sufficiently large: 



E, [F(Z^) |C(r 
This establishes (27) and completes the proof. 



n 



E[F((Y t ) < t < v )\ 



< 2e. 



□ 



Proof of Corollary 1. By Theorem 1 (i), A(t n )/n converges in probability toward 7 as n — > 00 so 
that B&( tn )/B n converges in probability toward 7 1 /( 2Ae ). In addition, the map Z 1— > sup sg ( ^(Z s — 
Z S J) is continuous on D([0, It follows from Theorem 3 that: 



■ max (Aim 

B n l<i<A(t„) SH 



W 1 a \r 

— y — sup Ar s 

7 «e[o,i] 



If 9 > 2, Y is continuous and the first assumption of Corollary 1 follows. If 9 < 2, the result 
easily follows from the fact that the Levy measure of Y is v(dx) = t{ x> o}dx/(T(— 9)x l+e ). □ 



2.5 Height of large conditioned non- generic trees 

We now prove Theorem 4. We keep the notation «*(t) of the Introduction, as well as the notation 



u 



(n) 



3j$ introduced in the beginning of Section 2.3. Recall also the notation P^j for the law 
of a forest of j independent GW M trees. If f = (ti, . . . , t^) is a forest, its height CK(f) is by 
definition max(IK(Ti), . . . , JK(Tfc)). 

Proof of Theorem 4- If r is a tree, let !K*^(r) = !K (T u („)T^J be the height of the subtree of 

descendants of in r. Recall that by Corollary 2.4, u* (in) — w*(t n ) with probability tending 
to one as n — >■ 00 and that by Theorem 2 (ii), the generation of u* (in) converges in distribution. 
It is thus sufficient to establish that, if (A n ) n >i of positive real numbers tending to infinity: 



P 



To simplify notation, set H$(t) = H(3§ } ) and (%> = C(3%>). Set p n = ln(n)/ ln(l/m) - A n . 
Let us first prove the lower bound, that is P ^K[ n \i n ) < p n — > as n — > 00. Fix < e < 7. As 
in the proof of Theorem 3, we can choose A > such that for n sufficiently large: 



ln(n) 



ln(l/m) 



r(n) 



< Xn 



(n) 



1. 



(36) 



P„ 



«n(r) - 7n 



> A 



C(r) 



< e. 
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Then for n large enough: 



P[^i n) (U <Pr 



<L ^ pn - 1 I C(T) = 



< 



n 



< E 
= E 

|j- 7 n|<AB; 



P„ 



^-1,Lk„/2J — Vni Kn — j, C( r ) 



n 



p. [CO 



JX l,Li/2j — Pnj — J, Si,[j/2J 



+ C 



+ e 



(n) 

b'/2j+ij 



+ iVr 



(n) 



n 



P, [COO 



+ e 



(n) 

To simplify notation, let fy/2\ and Kg be two independent random variables defined under P 
such that fu-/2j is distributed according to b'/ 2 J anc ^ ■^'o™' 1 nas the same law as Nq 1 ^ under 
P^ [• |Cut (n) ^ d]. Then, by Lemma 2.8: 



P, [CM 



rP„ 



n\ 



rw-( n ) „ _ • /-VJ i_ /-V*J \" 



(n) 



u(n) 



P^ [«n = j] 

F/x [COO = n 



P„ 



^-l,b72j ^ Pn, S>l,Li/2J + Hi/2J+I,j + iV o 



(n) 



r(n) 



n 



J] 



P„ [C(r) 



■E 



1 {w(f b / 2 j)<Pn} ' ^i-li/aj ( n ~ K o n) " C(fb'/2j) 



where we use the notation (pj(k) = [C(f) = fc] introduced in Section 2.2. It follows that: 



|j- 7 n[<AB; [C( r ) 



1 {M(f b/2J )< P „} ■ fs-ym ( n ~ K o n) - C(fb-/2j) 



+ e. 
(37) 

We now claim that it is sufficient to establish that for n sufficiently large, for every j G [yn — 
AB' n ,^n + AB' n ] and k G Z the following three estimates hold with some constants Cj and C^- 

Vi-Lj/aj(*)<Ci/B;. (38) 
P M [«« = J] < C^P^ [COO = n] (39) 
P[^(fb72j) <Pn] <e/A (40) 
Indeed, from (37), we will then get that 

P K n) (t«) < Pn] < ((2A + 1)0^/ A + l)e, 

and the lower bound will follow. 

The bound (38) is an immediate consequence of the local limit theorem (13). The estimate 
(39) follows from Lemma 2.10. Let us finally establish (40). Note that for n sufficiently large 
all the indices j appearing in the sum (37) satisfy j > (7 — e)n. Consequently, for n sufficiently 
large: 

P pKtfii/aj) < Pn] < (1 - P„ Mr) > p n ])^>M. (41) 

Since // satisfies Assumption (He), we have z ln(i)/ij < 00 . It follows from [14, Theorem 2] 
that there exists a constant c > such that: 



P„ pK(r) > fc] ~ c • m A 

fc— »oo 



(42) 
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Hence P M P~C(t) > p n ]) ~ c • l/(n • m A ") as n — > oo. The right-hand side of (41) consequently 
tends to as n — > oo. The estimate (40) follows, and the proof of the lower bound is complete. 

Set q n = ln(n)/ln(l/m) + A n . The proof of the fact that P bii n \i n ) > q v 
similar and we only sketch the argument. Write: 



— > as n — > oo is 



P K n) (t„) > q n ] < P M WL m > Qn | C(r) = n 



Under P^, nas the same distribution as ^i'^-i^ n /2j- ^ thus suffices to show that 

the first term of the last sum tends to as n — > 00. By similar arguments as in the proof of the 
lower bound, it is enough to verify that 



r(») 



■K 



[/ci/2j+l,K n ^ <?n I C( r ) - n 



P 



A1 ,L(7+e)n/2j 



0. 



This follows from (42), the fact that P M , L(7+e ) n / 2 j [IK(f) > q n ] = 1 - (1 - P M [5C(r) > g„]) L(7+e)n/2j 
combined with the asymptotic behavior P^ P~C(t) > g n ] ~ c • m An /n as n — )• 00. This completes 
the proof of the upper bound and establishes (36). □ 



2.6 Scaling limits of non- generic trees 



We turn to the proof of Theorem 5. Recall the notation u+, u 
Introduction and Section 2.3. 



(n) 



fir. 



and (Pj{k) from the 



Proof of Theorem 5. Fix rj G (0, l/ln(l/m)). We shall show that, with probability tending to 
one as n — > 00, at least \n(n) trees among the [K n (i n )/2\ trees of 3^™^ / 2 J (O have height at least 
r]ln(n). This will indeed show that, with probability tending to one as n — > 00, the number of 
balls of radius at most rj neaded to cover ln(n) -1 • t n tends to infinity. By standard properties 
of the Gromov-Hausdorff topology (see [7, Proposition 7.4.12]) this implies that the sequence of 
random metric spaces (ln(n) -1 • t n ) n >i is not tight. 

If f = (ti, . . . , Tj) is a forest, let E(f) be the event defined by 

E(f) = {at most ln(n) trees among T\, . . . , Tj have height at least r]\n(n)}. 

It is thus sufficient to prove that P E(3 r ^ K .^y 2 j (t„)) converges toward as n — > 00. Fix e > 0. 
As in the proof of Lemma 2.9, we can choose A > such that P [|ft n (t n ) — 771 1 > AB' n ] < e for n 



sufficiently large. It is thus sufficient to show that P E(3 r ^ K , 2 ,), 

n sufficiently large. We keep the notation f 1,-/21 and 'Nq 1 ^ introduced in the previous section. The 
same argument that led us to (37) gives 



\K 



n V^n) 



7 n| < AB' n 



< e for 



P 



C Wl,[K„/2J, 



MO -7^1 < AB' n 



E 

\j-jn\<AB; 



Pp, \j^r. 



J] 



, P. [CM 



E 



n 



E (fu/2j) -Vj-bVaj ( n 



(n) 



C(fL 



Using (38) and (39) we get the existence of a constant C > such that for every n sufficiently 
large 



P 



^l n/2i ),K(in)-in\<AB' n 



< 



C_ 
BL 



E P [E(f L 



J/2J. 



\j-jn\<AB' n 
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It is thus sufficient to prove that for n sufficiently large, if \j — 771 1 < AB' n then we have 
P [E(f [j/2j )] < e - To this end, write 



P [E(fb/2j)] = E J ) P m W > lM»)f C 1 - P . Mr) > riHn))) [ 



Note that \j — 771, | < implies that j < 2777, and j/2 — ln(n) > jn/3 for every n sufficiently 
large. Using (42) and setting if = r}]n(l/m), we get that for a certain constant C > 0, for every 
n sufficiently large, 



ln M / n\ 3/2-k / n \ 

P [E(fb/ 2 J)] < E i lnW (l-^j < Mn) • (2 T n) 1 - ■ exp , 



ln(n) / ^ N j/2-k 

E 

fc=0 

which tends to as n — > 00, uniformly in j. This completes the proof. □ 

Note that Theorem 5 implies that there is no nontrivial scaling limit for the contour func- 
tion coding t n , since convergence of scaled contour functions imply convergence in the Gromov- 
Hausdorff topology (see e.g. [27, Lemma 2.3]). 

2.7 Finite dimensional marginals of the height function 

To prove Theorem 6, we first need to extend the definitions of the height function and Lukasiewicz 
path to a forest. If f = (Tj)i<j<j is a forest, set n = and n p = C( r i) + C( r 2) + • • • + C( r p) f° r 
1 < V < j- Then, for every < i < p — 1 and < k < £(rj+i), set 

H„ i+fc (f) = H k (r l+1 ), W ni+fe (f) = W fc (r i+ i) - i. 

Note that the excursions of H(f) above are the (H ni+ fc(f); < k < C( r i+i)) and that the 
excursions of W(f) above its infimum are the (W„ i+ fc(f) — i\ < k < £(ri+i)). Finally, for j > 1, 
set Q = inf{n > 0; W n = —j} and for n > 0, set 

H n = Card({k e {0, 1, . . . , n - 1}; W k = inf WA) 

k<j<n 

We will also need the following extension of Proposition 1.4 (see e.g. [27, Proposition 1.7] for 
a proof): 

Proposition 2.11. Let j > 1 be an integer. Under f^j, (Hi(f), ^i(f))o<i<((f) has the same 
distribution as (Hi, Wi)o<i<£-. 

In particular, note that P MJ - [£(f) = n] — P [Cj = n]. Finally, the following result, which is an 
unconditioned version of Theorem 6, will be useful. 

Lemma 2.12. For every < s < t < 1 and for every b > 0, the following convergence holds in 
distribution: 

(h TT W Vnt-b\+int \ w , Y , 
\ H [ns _ hb H [nt _ bb ^— ) (ei,e 2 ,r t J 



\nt\ 



where e!,e 2 and Y t are independent. 
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Proof. To simplify, we suppose that 6 = 0, the proof being the same in the general case. For every 
k > 0, set M k = Card({l < i < k; Wi = max <j<j Wj}). Set also w\ n) = W\nt\-\ns\+i-W\_ n t\-\ns\ 
for i > 0. Notice that (W^ n \i > 0) has the same distribution as (Wj,i > 0). Define similarly 
il4 n) = Card({l < i < k; W t (n) = max <j<i wj n) }). Since (W u < % < n) and (W n - W n _ i; < 
i <n) have the same distribution, we have 



(d) 



B 



[nt\ 



M \ns\i M \nt\i 



B 



\nt\ 



It is thus sufficient to check that the last expression converges in distribution toward (ei, e2, Y t ). 
To this end, first set 



T = sup{i > 0, Wi = sup Wj}, T (n) = sup{i > 0, w/ n) = sup W (n) } 



which have the same distribution. Since W drifts almost surely to — oo, T is almost surely finite, 
and Mt is distributed according to a geometric random variable of parameter P [Vi > 1, Wi < — 1]. 
By [33, Theorem 1 in Chapter 2], we have P [Vi > 1, W t < -1] = -E [Wi] = 1 - m. 

Let F\,F 2 : Z — > M + be bounded functions and let G : K — > R + a bounded uniformly 
continuous function. Next choose No > such that P [T > iVo] < e. For i > No, note that 
M\ n) = M$ on the event < N and that M { = M No on the event T < N . Thus, for 
n > N /s, 



E 



Fi(Mgj)F 2 (M M )G 





-E 


\ B [nt} )_ 





< Ce 



where C > is a constant depending only on Fx, F 2 , G (and which may change from line to line). 
For n sufficiently large, using the fact that G is uniformly continuous, one also has 



E 



F 1 (M%)F 2 (M No )G 



B 



-E 



F 1 (Mfr)F 2 (M No )G 



W lnti -W No -W$+int 



B 



[nt\ 



< 2Ce. 



Using the fact that M No , and W\_ nt \ — Wn — are independent for n > N /(s A (t — s)), 
we get that, for n sufficiently large, 



E 



F^M^F^M^G 



w [nti - w No -w% + in t 



B 



[nt\ 



E 



F^Mft) E[F 2 (M No ))E[G(Y t )} 



is less than Ce. Since 



E 



F!(M (n) 



N 



-E 



Fi(M$ n) ) < Ce and |E [F 2 {M No )) - E [F 2 (M T )}\ < Ce, 



we finally conclude that for n sufficiently large, 



E 



F 1 (M«)F 2 (M 1 „ ij)G ( H i|±^- 



E 



FM n > n) ) E[F 2 (M T )}E[G(Y t )} 



< Ce. 



This completes the proof. 



□ 
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Remark 2.13. It is straightforward to adapt the proof of Lemma 2.12 to get that for every 
< t\ < £2 < ■ • • < tk < 1 and for every b > 0, the following convergence holds in distribution: 

Iff Tf ff W \nt k -b\ +int k \ jdy , 
\H [ntl _ bi ,H [nt2 _ bi ,...,H [ntk _ bi , — -> (ei,e 2 , . . . ,e k , Y tk ) 

\ n [nt k \ ) 

where {^\)\<%<k and Y tk are independent. 

We are now ready to prove Theorem 6. Recall the notation tt*(r) of the Introduction as well 

(n) 
: -,3 



as the notation u* 1 , K n , 3^ " , iVo and <Pj(k) introduced in the beginning of Section 2.3. 



Proof of Theorem 6. To simplify, we establish Theorem 6 for k = 2, the general case being 
similar. To this end, wefixO<s<t<l and shall show that 

(# Lns j(t n ),if Ln4 j(t„)) ^ (l + eo + ei,l + e + e a ). (43) 

Let F : Z 2 — > IR + be a bounded function. To simplify notation, if r is a tree set G(t) = 
F(H\£/ T \ S ^ (r), i?LC(T)t| ( r ))- As m the P ro °f °f Lemma 2.9, we can choose A > such that for 
every n sufficiently large: 

E [G(t n )] - E [G(t n )l { | Kn{tn) _ 7n |<^ } ] j < e. (44) 
It is thus sufficient to show that, for an appropriate choice of A, for every n sufficiently large, 

|E [G(t n )l {K(r) _ 7n |<^j] - E [F(l + e + e 1? 1 + e + e 2 )]| < e. (45) 
To this end, write 

p i K = j~\ 

E [G(t n )l { | Kn(tn) _ 7n |< AB;} ] = £ ^ " E^ [G(r)l {ar)=n} \K n = j] 

\j-rn\<AB' n r M LW ~ n \ 

Now, for every n > 1 and j > 1, let (S^ n \ Nj^, J% ) and fj be random variables such that: 

- (§("), X( n ),^ n) )„>i and (fj)j>i are independent, 

- fj is distributed according to P MJ -, 

- if T n is a tree pointed at vertex v n that has the same distribution as Cut^ under the 
conditioned probability measure P M [ • |Cut {n) ^ d] , then S< n > is the height of v n , is the 
number of vertices of % n which are less than or equal to v n in the lexicographical order, 
and finally is ((% n ). 

Then, by Lemma 2.8, we have 
E^ [G(r)l {c(T)= „ } |K„ = j] 



E 



F 1 1 + S (n) + H h _^)j (fc), 1 + S (n) + (fc) 1 1 



^ / "{^ n) +C(fj)=n} 
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By Lemma 2.9, converges in distribution toward the height of u+ in 7, which is distributed 
according to a geometric random variable of parameter 1 — m. Thus (45) will follow if we manage 
to check that, for fixed integers a, b > 0, 



E 



\j - 7 n|< AB' n 11 » 



K = j] 



-E [F (H Lr „_ 6J (f,-), Hl«*-*J (fi)) 1 



{a+C(fj)=n} 



E[F( ei) e 2 )] 



< e (46) 



for n sufficiently large and for an appropriate choice of A which will be specified later. To this 
end, using the notation introduced in the beginning of this section and Proposition 2.11, write 



{a+ah)=n} 



E [F(if|_ ns _6j,iif|_ nf _6j)l{^ =n _ a }] , 



Set p = ( 7 (1 -t)) l ^ 2Ae \P' = (t/(l-t))V(2A*) and j n (u) = [^n + uB' n \ for uGl. We claim that 
for fixed \u\ < A + 1 



(47) 



B> n E [F(H lns . bb H lnt . bl )l {( ; jn{u)=n . a} ] _^E[F( ei ,e 2 )] • E [± Pl (-/?' • K t - i0 



To establish this convergence we use the following Lemma, whose proof is postponed to the end 
of this section. 

Lemma 2.14. Fix A > 0. There exist two constants c, 5 > such that for every n > 1 and 
Id < A+l 



(z) P [W Vnt ^ + j» < 7 n(l - t)/2] < ce~ n \ (ii) P 



inf Wi < -i„(u) 

0<i<[nt-b\ 



<ce~ n . 



To establish (47), we use the Markov property of the random walk W at time [nt — b\ to get 

E [F(Hi n 8-b], Hi n t-b])^{(j=n-a}] 

= E [F(H [ns _ bl , H [nt _ bl )l {C:j>[nt _ bl} ■ ip Wlnt _ bi+j (n - a - [nt - b\)] . 

To simplify notation, set U n (j) = E [F(H Lns _ 6J , F Lnt _ 6 j )v 2 vy Lnt _ t , J +i(^ - o - |n* - )] • By Lemma 
2.14 (ii), for fixed \u\ < A+l, 



inf Wi < -j n (u) 

0<i<[nt-b\ 



B' n .F[C Mu) <[nt-b\]<B' n .F 
Thus (47) will follow if we manage to check that for fixed \u\ < A + 1 



0. 



B' n U n (j n (u)) 



E[F( ei ,e 2 )] -E 



/3 J 



Pi -/?' ■ Y t 



7/3 J 



(4E 



To this end, write 



B' n U n (j n (u)) 

= B' n E [F(H lns _ bl ,H lnt ^ bl )(p W[nt _ bi+jn{u) (n - a - [nt - &J)l{iy L „ t _ bJ +j«(«)<7n(i-t)/2}] (49) 
+K E [F(H\n 8 -b\, H lnt-b\)^w [nt _ bi +j n (u)(n -a - [nt- b\)t{W lnt _ bi +j n (u)>jn(l-t)/2}] 
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By the Skorokhod representation theorem, we may assume that the convergence appearing in 
Lemma 2.12 holds almost surely. Then (13) and Lemma 2.6 imply that 



B n F ( H [ns-b\, H lnt-b\)^W lnt _ bi +j n (u)(n -a - [nt- b\ )^{W lnt _ bi +Mu)>~,n(l-t)/2} 



(50) 



F(ei,ea)ip a - Y 



u 



and that the expression appearing in (50) is bounded uniformly in n > 1 and \u\ < A + 1. In 
addition, by Lemma 2.14, the expression appearing in (49) tends to as n — > oo. The convergence 
(48) then follows from an application of the dominated convergence theorem, and our claim (47) 
is proved. 

To complete the proof we have to check that (46) holds for n sufficiently large. To this end, 
using Lemma 2.14 (ii), write 



E 



\j-~fn\<AB> n 11 A 4 



gV K = j] 

, K [C(r) = n 



E [F (H Lns _ bJ (fj), H Lnt _ 6J (fj)) l {a 



+C(fj)=n} 



A+0{1) > „, ^ [«n = Jn(w)] , . . f . 

-a +oW duBn ^Ur) = n] B ^{u)) + o(l). 



(51) 



We claim there exists a constant C > such that B' n U n (j n (u)) < C for every n > 1 and 
|u| < A + 1. Indeed, using (13) and the fact that (B' n ) n >i can be chosen to be increasing, we 
have, for a certain constant c > 0, 

U n {j n {u)) < HFIUP [^ Lnt _ feJ + j n (u) < 77i(l - t)/2] + 



^ 7 n(l-t)/2 

Our claim then follows by Lemma 2.14 (i). Hence we are in position to apply the dominated 
convergence theorem. By (48) and Lemma 2.10, we get that as n — > oo, the expression appearing 
in (51) converges to 

1 ( „, u \ 



E[F(e 1 ,e a )] f duE 

J -A 



7/3 



Pi 



-13' ■ Yt 



7/3 J 



As in the end of the proof of Theorem 3, this last expression tends to E [F(e 1 ,e 2 )] as A — > oo. 
Consequently, by choosing A sufficiently large, (46) holds for n sufficiently large. This completes 
the proof of Theorem 6. □ 

It remains to prove Lemma 2.14. 

Proof of Lemma 2.14- To simplify, we suppose that 6 = 0, the general case being similar. To 
simplify notation, set a = (1 + (9)/2 A 2 and recall the notation W n = W n + 772 for n > 0. Note 
that E \Wi\ = and that E \W°i\ < 00. Next, using the inequality e~ x — 1 < — x + x a valid for 
every x > 0, we get that for every A > 



E 



1 < A°E [W°] . 



Hence there exists a constant c\ > such that InE 
using Markov's inequality we have 



-\Wi 



< C\\ a for every A > 0. Next, by 



P[WVj +j n {u)<in{l-t)/2 



P 



-AVK L „ tJ > -A( 7 n(l-t)/2+7Lra*J -•?«(>)) 



< e 



X(jn(l-i)/2+j[ntj-j n {u)) 



exp 



([ntj InE 



-XWi 
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But for n sufficiently large and for every \u\ < A + 1 we have 771(1 — t)/2 + 7 \nt\ — j n (u)) < 
—772(1 — £)/3. Hence, for n sufficiently large, taking A = l/n a , we get 

P [W [nt j + j n (u) < 7 n(l - t)/2] < e -^(l-*)/3+n tci A« < etCl -7(l-t)/3-ni-V«_ 

This establishes (i) since a > 1. 

For (ii), we apply Doob's maximal inequality with the submartingale (^e x<K ~ Wn \n > fj) to get: 



P 



inf W< < -j n (u) 

0<i< [nt\ 



inf W i < 1 [nt\ -j n (u) 

0<i< \nt\ 



< P 

< e A (7L«*J-in(«))]g 



p 



sup e 

0<i<|nij 



-AW, 



> e 



-\{f[nt\ -j n (u)) 



[nt\ 



< e 



A(7|ntJ-j„(u))+ntciA Q 



Taking A = l/n a and noting that 7 \ nt\ — j n {u) < —7/1(1 — t)/2 for every n sufficiently large, 
the conclusion follows. This completes the proof. □ 



3 Extensions and comments 

We conclude by proposing possible extensions and stating a few open questions. 

Other types of conditioning. Throughout this text, we have only considered the case 
of Galton- Watson trees conditioned on having a fixed total progeny. It is natural to consider 
different types of conditioning. For instance, for n > 1, let t„ be a random tree distributed 
according to P M [ • |!K(t) > n]. In [18, Section 22], Janson has in particular proved that when [i 
is critical or subcritical, as n — )■ 00, converges locally to a random infinite tree T*, which is 
different from T. It would be interesting to know whether the theorems of the present work apply 
in this case. 

Another type of conditioning involving the number of leaves has been introduced in [9, 26, 32]. 
If r is a tree, denote by A(r) the number of leaves of r (that is the number of individuals with 
no child). For n > 1 such that P M [A(r) = n] > 0, let t l n be a random tree distributed according 
to P M [ • |A(r) = n\. Do results similar to those we have obtained hold when in is replaced by t l n 7 
We expect the answer to be positive, since a GW M tree with n leaves is very close to a GW M with 
total progeny n/{i (see [26] for details), and we believe that the techniques of the present work 
can be adapted to solve this problem. 

Concentration of ^K(t n ) around ln(n)/ln(l/m). By Theorem 4, the sequence of random 
variable (JC(t n ) — ln(n)/ln(l/m)) n >i is tight. It is therefore natural to ask the following question, 
due to Nicolas Broutin. Does there exist a random variable J$? such that: 

K(«.)-J5$L <■% jf 7 
ln(l/m) n ^°° 

We expect the answer to be negative. Let us give a heuristic argument to support this prediction. 
In the proof of Theorem 4, we have seen that the height of !K(tn) is close to the height of \p(n\ 
independent GW M trees and the height of each of these trees satisfies the estimate (42). However, 
if {Qi)i>i is an i.i.d. sequence of random variables such that P [Q\ > k] = c- m fc , then it is known 
(see e.g. [17, Example 4.3]) that the random variables 

max(<2i, Q 2 , ...,Q n )- 

m(l/m) 
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do not converge in distribution. 

Behavior of E p-C(t n )]. We conjecture that: 

ln(n) 



E[M(tn)] 



ln(l/m) 



However, it seems that more precise estimates that the ones we have used are needed to prove 
this statement. 

Other types of trees. Janson [18] gives a very general limit theorem concerning the local 
asymptotic behavior of simply generated trees conditioned on having a fixed large number of 
vertices. Let us briefly recall the definition of simply generated trees. Fix a sequence w = (wk)k>o 
of nonnegative real numbers such that Wo > and such that there exists k > 1 with Wk > (w 
is called a weight sequence). Let Tf C T be the set of all finite plane trees and, for every n > 1, 
let T n be the set of all plane trees with n vertices. For every r G Tf, define the weight w(t) of r 
by: 

w(t) = n w ku{r) . 

Then for n > 1 set 

Z n = W ( T )- 

TgT n 

For every n > 1 such that Z n ^ 0, let 7 n be a random tree taking values in T n such that for 
every r G T n : 

P[T„ = r] = ^. 

The random tree T n is said to be finitely generated. Galton- Watson trees conditioned on their 
total progeny are particular instances of simply generated trees. Conversely, if 7 n is as above, 
there exists an offspring distribution fi such that T„ has the same distribution as a GW M tree 
conditioned on having n vertices if, and only if, the radius of convergence of J2 w i z% ls positive 
(see [18, Section 8]). 

It would thus be interesting to find out if the theorems obtained in the present work for 
Galton- Watson trees can be extended to the setting of simply generated trees whose associated 
radius of convergence is 0. In the latter case, Janson [18] proved that 7 n converges locally as 
n — > oo toward a deterministic tree consisting of a root vertex with an infinite number of leaves 
attached to it. We thus expect that the asymptotic properties derived in the present work will 
take a different form in this case. We hope to investigate this in future work. 
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